pip install --upgrade pip
Requirement already satisfied: pip in c:\users\par20\anaconda3\lib\site-packages (22.3) Note: you may need to restart the kernel to use updated packages.
pip install Pandas-Profiling
Requirement already satisfied: Pandas-Profiling in c:\users\par20\anaconda3\lib\site-packages (3.3.0) Requirement already satisfied: matplotlib<3.6,>=3.2 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (3.3.4) Requirement already satisfied: requests<2.29,>=2.24.0 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (2.25.1) Requirement already satisfied: PyYAML<6.1,>=5.0.0 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (5.4.1) Requirement already satisfied: phik<0.13,>=0.11.1 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.12.2) Requirement already satisfied: missingno<0.6,>=0.4.2 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.5.1) Requirement already satisfied: pydantic<1.10,>=1.8.1 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (1.9.2) Requirement already satisfied: pandas!=1.4.0,<1.5,>1.1 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (1.2.4) Requirement already satisfied: joblib~=1.1.0 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (1.1.1) Requirement already satisfied: jinja2<3.2,>=2.11.1 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (2.11.3) Requirement already satisfied: visions[type_image_path]==0.7.5 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.7.5) Requirement already satisfied: tangled-up-in-unicode==0.2.0 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.2.0) Requirement already satisfied: tqdm<4.65,>=4.48.2 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (4.59.0) Requirement already satisfied: statsmodels<0.14,>=0.13.2 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.13.2) Requirement already satisfied: numpy<1.24,>=1.16.0 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (1.20.1) Requirement already satisfied: scipy<1.10,>=1.4.1 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (1.6.2) Requirement already satisfied: multimethod<1.9,>=1.4 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (1.8) Requirement already satisfied: htmlmin==0.1.12 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.1.12) Requirement already satisfied: seaborn<0.12,>=0.10.1 in c:\users\par20\anaconda3\lib\site-packages (from Pandas-Profiling) (0.11.1) Requirement already satisfied: networkx>=2.4 in c:\users\par20\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.5->Pandas-Profiling) (2.5) Requirement already satisfied: attrs>=19.3.0 in c:\users\par20\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.5->Pandas-Profiling) (20.3.0) Requirement already satisfied: Pillow in c:\users\par20\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.5->Pandas-Profiling) (8.2.0) Requirement already satisfied: imagehash in c:\users\par20\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.5->Pandas-Profiling) (4.3.1) Requirement already satisfied: MarkupSafe>=0.23 in c:\users\par20\anaconda3\lib\site-packages (from jinja2<3.2,>=2.11.1->Pandas-Profiling) (1.1.1) Requirement already satisfied: cycler>=0.10 in c:\users\par20\anaconda3\lib\site-packages (from matplotlib<3.6,>=3.2->Pandas-Profiling) (0.10.0) Requirement already satisfied: python-dateutil>=2.1 in c:\users\par20\anaconda3\lib\site-packages (from matplotlib<3.6,>=3.2->Pandas-Profiling) (2.8.1) Requirement already satisfied: kiwisolver>=1.0.1 in c:\users\par20\anaconda3\lib\site-packages (from matplotlib<3.6,>=3.2->Pandas-Profiling) (1.3.1) Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.3 in c:\users\par20\anaconda3\lib\site-packages (from matplotlib<3.6,>=3.2->Pandas-Profiling) (2.4.7) Requirement already satisfied: pytz>=2017.3 in c:\users\par20\anaconda3\lib\site-packages (from pandas!=1.4.0,<1.5,>1.1->Pandas-Profiling) (2021.1) Requirement already satisfied: typing-extensions>=3.7.4.3 in c:\users\par20\anaconda3\lib\site-packages (from pydantic<1.10,>=1.8.1->Pandas-Profiling) (3.7.4.3) Requirement already satisfied: urllib3<1.27,>=1.21.1 in c:\users\par20\anaconda3\lib\site-packages (from requests<2.29,>=2.24.0->Pandas-Profiling) (1.26.4) Requirement already satisfied: idna<3,>=2.5 in c:\users\par20\anaconda3\lib\site-packages (from requests<2.29,>=2.24.0->Pandas-Profiling) (2.10) Requirement already satisfied: chardet<5,>=3.0.2 in c:\users\par20\anaconda3\lib\site-packages (from requests<2.29,>=2.24.0->Pandas-Profiling) (4.0.0) Requirement already satisfied: certifi>=2017.4.17 in c:\users\par20\anaconda3\lib\site-packages (from requests<2.29,>=2.24.0->Pandas-Profiling) (2020.12.5) Requirement already satisfied: patsy>=0.5.2 in c:\users\par20\anaconda3\lib\site-packages (from statsmodels<0.14,>=0.13.2->Pandas-Profiling) (0.5.3) Requirement already satisfied: packaging>=21.3 in c:\users\par20\anaconda3\lib\site-packages (from statsmodels<0.14,>=0.13.2->Pandas-Profiling) (21.3) Requirement already satisfied: six in c:\users\par20\anaconda3\lib\site-packages (from cycler>=0.10->matplotlib<3.6,>=3.2->Pandas-Profiling) (1.15.0) Requirement already satisfied: decorator>=4.3.0 in c:\users\par20\anaconda3\lib\site-packages (from networkx>=2.4->visions[type_image_path]==0.7.5->Pandas-Profiling) (5.0.6) Requirement already satisfied: PyWavelets in c:\users\par20\anaconda3\lib\site-packages (from imagehash->visions[type_image_path]==0.7.5->Pandas-Profiling) (1.1.1) Note: you may need to restart the kernel to use updated packages.
import numpy as np
import pandas as pd
from pandas_profiling import ProfileReport
import seaborn as sns
import matplotlib.pyplot as plt
%matplotlib inline
df_medicare_covid = pd.read_csv("C:/Users/par20/Downloads/COVID_19_Hospitalization_Trends_Report_Data_file_20220526/COVID_19_Hospitalization_Trends_Report_Data_file_20220526.csv")
df_medicare_covid.head()
| Year | Month | Bene_Geo_Desc | Bene_Mdcd_Mdcr_Enrl_Stus | Bene_Race_Desc | Bene_Sex_Desc | Bene_Mdcr_Entlmt_Stus | Bene_Age_Desc | Bene_RUCA_Desc | Total_Bene_Hosp | Total_Mth_Enrl | Total_Bene_Enr_Hosp_Per100K | AVG_los | Pct_Dschrg_SNF | Pct_Dschrg_Expired | Pct_Dschrg_Home | Pct_Dschrg_Hspc | Pct_Dschrg_HomeHealth | Pct_Dschrg_Other | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2020 | Overall | National | All | All | All | All | All | All | 983879.0 | 6.251189e+07 | 1573.9070 | 10.5787 | 0.1926 | 0.1799 | 0.3571 | 0.0493 | 0.1532 | 0.0679 |
| 1 | 2020 | Overall | National | All | All | All | All | All | Rural | 189003.0 | 1.206914e+07 | 1566.0028 | 9.7035 | 0.1741 | 0.1755 | 0.3941 | 0.0391 | 0.1443 | 0.0728 |
| 2 | 2020 | Overall | National | All | All | All | All | All | Urban | 792524.0 | 4.981956e+07 | 1590.7888 | 10.7810 | 0.1970 | 0.1809 | 0.3483 | 0.0517 | 0.1555 | 0.0667 |
| 3 | 2020 | Overall | National | All | All | All | All | All | Unknown | NaN | 6.231943e+05 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 2020 | Overall | National | All | All | All | All | 0-64 | All | 130239.0 | 8.319817e+06 | 1565.4070 | 11.5841 | 0.1475 | 0.1195 | 0.4955 | 0.0145 | 0.1305 | 0.0926 |
Report of the dataset
EDA_df_medicare_covid = ProfileReport(df_medicare_covid, title="EDA_df_medicare_covid")
EDA_df_medicare_covid
EDA_df_medicare_covid.to_file(output_file="EDA_df_medicare_covid.html")